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In this paper we investigate the classical communication cost of converting between different forms 
of bipartite pure state entanglement in the many copy case. This problem is usually broken into two 
parts: concentrating the entanglement from many partially entangled states into a smaller number 
of singlets and the reverse process of diluting singlets into partially entangled states. 

Entanglement concentration requires no classical communication, but the best prior art result for 
diluting to A'^ copies of a partially entangled state requires an amount of communication on the order 
of v'^. Our main result is to prove that this prior art result is optimal up to a constant factor; any 
procedure for approximately creating A'' partially entangled states from singlets requires f2(\/iV) 
bits of classical communication. Previously not even a constant bound was known for approximate 
entanglement transforms. 

We also prove a lower bound on the inefficiency of the process: to dilute singlets to copies of 
a partially entangled state, the entropy of entanglement must decrease by 



I. BACKGROUND 

A useful paradigm in quantum information processing 
is the resource model where various entities including 
classical communication cost, quantum communication 
cost and entanglement are regarded as different funda- 
mental resources. For a resource model to make sense, 
different forms of the same resource need to be inter- 
convertible or fungible in the asymptotic limit. For ex- 
ample, many independent copies of any classical random 
variable with entropy H can be compressed asymptoti- 
cally losslessly to H bits per copy Similarly, a quan- 
tum state p can be compressed to S(p) qubits in the many 
copy limit 0, p. 

In this paper, we will discuss the resource of entan- 
glement: specifically, bipartite pure state entanglement. 
This sort of entanglement was proposed as asymptoti- 
cally fungible (and thus a resource) by Bennett, Bern- 
stein, Popescu and Schumacher (BBPS) If \'4')ab 
a bipartite pure state with entropy of entanglement E = 
SiTTB\^){^\) and 1$) is the singlet state ^(|01)-|10)) 

BBPS explained how to approximately convert |$ 
into and back again, using local operations and 

classical communication (LOCC) only. The transforma- 
tions, known as entanglement concentration and dilution, 
are only asymptotically reversible; in each direction we 
accept some inefficiency (so that instead of converting 
1$)"^ to IV")", we need to start with 1$)"^+°^"-') and a 
small error e G o(l) (so instead of obtaining the state 
IV')", we get p with Tr|p- |'0)(V'ri < e)- 

An additional feature of the BBPS protocol for en- 
tanglement dilution is a classical communication cost of 
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0{n) bits. In contrast, their entanglement concentration 
procedure requires no communication at all. For some 
applications such as super-dense coding paying an 
0{n) classical communication cost would mean that the 
utility of entanglement for different tasks depended on 
its form. If there were scenarios in which different forms 
of pure entanglement were not asymptotically equivalent, 
then the resource model of entanglement could become 
much more complicated. Fortunately, in 1999 Lo and 
Popescu showed that the original dilution protocol 
could be modified to require only o{n) bits of classical 
communication, while the error and inefhciency remained 
o(l) and o{n) respectively. Thus, any two states with 
the same entropy of entanglement are asymptotically in- 
terconvertible, even if we take into account the cost of 
classical communication. The specific dilution procedure 
of used 0{y/n) bits, but left open the question of 
whether this could be improved. The main result of 
this paper proves that no such improvement is possi- 
ble. In other words, we will show that n{y/n) bits of 
classical communication are necessary for entanglement 
dilution. Previously it was known that some exact entan- 
glement transformations were only possible with a non- 
zero amount of classical communication, but for approx- 
imate transformations no bounds were known. We also 
prove a similar lower bound on inefficiency; creating n 
states, each of entanglement E, requires starting with 
nE + n{y/n) singlets to achieve 0(1) error. 

A main motivation of our study is to understand fluc- 
tuations in a finite system. For entanglement manipu- 
lation, these take the form of probability of failure, im- 
perfect fidelity, suboptimal yield and required classical 
communication. Given that any physical system must 
have only a finite number of constituents, it is important 
to understand the magnitude and origin of those prob- 
lems. That is to say: How quickly does a finite-copy 
system approach its thermodynamic/asymptotic limit as 
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its number of copies increases? As we will see later, some 
of those problems are related to the fact that when dif- 
ferent states are not related to each other in trivial ways, 
we can only approximate them. Others are related to the 
spectrum of the Schmidt coefRcients. 

Errors and the probability of failure are results of dis- 
cretization and can be made exponentially small in n. In 
contrast, we will show that the inefhciency and the classi- 
cal communication cost of entanglement dilution are nec- 
essarily n{y/n) because they stem from differences in the 
shapes of spectra of the Schmidt coefficients of different 
states. We will make this notion precise later, by defin- 
ing the significant subspace (or (5-significant subspace) of 
a density matrix p to be a subspace that contains at least 
S of the weight of p for some 0(1) constant S ||2^. This 
definition generalizes typical subspaces, which are usually 
defined as containing almost all the weight of a density 
matrix. We will show that, for a generic p that is nei- 
ther pure nor maximally mixed, the ratio of the size of a 
typical subspace of to that of a (5-significant subspace 
(say with 3=1/4) of p" is large, namely 2^^^). It is the 
logarithm of this ratio that gives rise to the fundamental 
constraint — that the inefficiency and classical communi- 
cation cost of entanglement dilution have lower bounds 

of f7(v^). 

To put these results in perspective, it is worth noting 
that the entanglement dilution and concentration proto- 
cols of BBPS both reduce the entropy of entanglement by 
O(y^). In both cases, this amount of inefRciency turns 
out to be optimal. We will prove the dilution bound in 
section and for concentration, [jl^ proved that o(l) er- 
ror requires an inefficiency of n{^/n) |2^. Similarly, both 
classical and quantum data compression require Q{^/n) 
more space (either bits or qubits) to compress a source 
than would be implied by the entropy of the source . 

This paper is organized as follows. Section II contains 
the formal statement of our main result — that entangle- 
ment dilution necessarily requires f2(-\/n) classical bits 
of communication and an inefficiency of il.{y/n). In the 
next two sections, we present useful intermediate results: 
In Section III, we show that a general strategy of en- 
tanglement dilution can be equivalently rephrased as a 
much simpler one. In Section IV we define ^-significant 
subspaces and study their properties. Our main result 
is proved in Section ^, and Section VI contains some 
concluding remarks and discussions. 



II. STATEMENT OF THE MAIN RESULT 

If a partially entangled bipartite pure state has 
entropy of entanglement E, then lip)"' can be approxi- 
mately prepared by two distant parties, Alice and Bob, 
from roughly uE singlets using only LOCC. The main 
result of the present paper is that any such dilution pro- 
cedure must use n{y/n) bits of classical communication. 
Along the way, we will also prove that dilution protocols 
cannot be perfectly efficient, and inevitably waste 



bits of entanglement. The formal statement of our main 
result is as follows: 

Theorem 1 Let ji/i) e "Hab be a bipartite pure state 
that is neither separable nor maximally entangled with 
its entropy of entanglement E = S {Tr gl^p) {ip\) where 
S {TrB\ip){ip\) is the entropy of a reduced density ma- 
trix. Let = 0, maximally entangled 
state of dimension d. Then there exist a universal con- 
stant £o and constants a and uq that depend on lip) such 
that Ve < eo; Vn > no, any entanglement dilution proto- 
col transforming \^d) into ji/')" with error e, probability 
of success 2^" and using c bits of classical communication 
must have 

a) log d > uE -\- a^/n and 

b) c + s > a^/n. 

This establishes the lower bounds as strongly as possi- 
ble, by requiring only a constant bound on the error and 
disallowing the possibility of a trade-off between ineffi- 
ciency and classical communication cost. In the above 
Theorem, we have made use of the following definitions: 

Definition of the trace distance: The trace distance is 
defined as 



D{p,a) = Tv\p-a\. 
We will often use the equivalent formulation 



D{p,a) = 2maxTr(P(p 



^)) 



(1) 



(2) 



where P is a projector ||^. It is important not to confuse 
this distance with the matrix norm, which we define in 
the usual way. 

Definition of the norm of a matrix: By the norm of 
a matrix A, denoted ||^||, we mean its largest singular 
value. Equivalently, \\A\\ = sup|^|^i 

Definition of error: By error e, we mean that upon 
success the protocol outputs not but some possibly 
mixed state a with D{a, lip) ) = e. For this proof we 
require only that the protocol has an error e smaller than 
a universal constant eo ~ 0.01. 

Definition of failure: Failure, on the other hand, means 
that sometimes Alice's measurement yields a state that 
is far from lip) . 

Remark: Our lower bound on classical communication, 
stated in Part b) of Theorem |l|, still holds even if the 
probability of a protocol succeeding is vanishingly small 
(i.e. 2~°*^^)). At first glance this might seem to be a 
surprisingly strong result. Why should our bound apply 
so broadly to probabilistic protocols? When we prove 
Theorem |^ (in Section ^) we will find that the commu- 
nication bound is independent of the amount of prior en- 
tanglement used. As a result, it is possible to convert any 
probabilistic protocol into a nearly deterministic protocol 
and vice versa in the following manner: 

Suppose there exists a protocol with a 2~* probabil- 
ity of success (and c bits of classical communication). 
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We will show that it is always possible to perform a 
modified protocol with 1 — e probability of success and 
c + s + 0(loglog(l/e)) bits of communication, at the cost 
of massively increasing the inefficiency (i.e., the loss of en- 
tanglement). To implement the modified protocol, Alice 
performs her measurement 2*0(logl/e) times on differ- 
ent inputs and with « 1 — e probability she will succeed 
on at least one of them. She can then transmit the index 
of the successful block in s-|-0(loglog(l/e)) bits and send 
the c-bit measurement result corresponding only to the 
successful outcome while discarding the failures. 

Conversely, c bits of classical communication can al- 
ways be eliminated by having Bob guess the message Al- 
ice would have sent with a probability of success. (In 
this way, Alice needs to send Bob at most one additional 
bit to inform him whether his guess is correct.) Thus it is 
in general impossible to bound either success probability 
or classical communication cost independently. Instead, 
Part b) of Theorem |l| gives a tradeoff between success 
probability and classical communication cost. 



III. REDUCING ENTANGLEMENT 
MANIPULATION PROTOCOLS TO A 
STANDARD FORM 

A general strategy for entanglement manipulation may 
involve two-way classical communication between Alice 
and Bob. Suppose Alice and Bob have some method of 
performing an approximate entanglement manipulation 
procedure using local operations and c total bits of classi- 
cal communication (in either direction) . In entanglement 
dilution, for example, Alice and Bob begin sharing some 
number of perfect singlets and with high probability end 
with a mixed state that approximates many copies of a 
partially entangled state. In this section, we will simplify 
the description of any such entanglement manipulation 
procedure. 

Claim 2 Given a pure bipartite initial state and any 
LOCC entanglement manipulation protocol by Alice and 
Bob that involves no more than c bits of classical commu- 
nication, there exists an equivalent strategy (meaning it 
uses the same amount of communication, takes the same 
inputs and has the same output distribution) consisting 
of the following: 

1. Alice performs a generalized measurement {Mk\ 
with no more than 2"^ outcomes on her half of the 
input. 

2. She transmits the result to Bob using c bits of clas- 
sical communication. 

3. Bob performs a unitary operation Uk conditioned 
on the result. 

4- Both sides discard ancillary systems. 



A similar claim was proved in p7[ : it was shown that 
any LOCC entanglement manipulation strategy that be- 
gins and ends with pure states can be reduced to one that 
uses only one-way communication. 

However, we need to address two new subtleties here. 
First, we need to consider approximate entanglement 
transformations which begin in pure states but can end 
in mixed states. Second, does not specifically ad- 
dress the issue of classical communication cost. In our 
modified protocol we need Alice to transmit her entire 
measurement outcome using no more classical communi- 
cation than the original protocol. 

Proof Local operations can be broken into unitary 
transforms, measurements, adding ancilla systems, and 
discarding ancillas (22j. Without loss of generality we 
can add all the ancilla systems at the beginning of the 
protocol and discard all the subsystems at the end. 

Simplifying measurements and classical communica- 
tion is more complicated. Consider any measurement 
performed in a dilution protocol. By Neumark's theorem 
[|20| , we can convert such a measurement into a unitary 
operation followed by a projective measurement in the 
computational basis. This can be thought of as coupling 
the system to a measurement apparatus and then record- 
ing the state of the measuring device. Denote the effects 
of a projective measurement (on a single qubit) by a su- 
peroperator, $. It has Kraus operators |0)(0| and |1)(1|. 
We can also represent the classical channel between Al- 
ice and Bob as a quantum channel that maps input p to 
output $(p). 

At this point, the most general protocol is equivalent 
to one in which Alice performs a unitary operation, ap- 
plies $ some number of times, and transmits some bits 
via the classical channel; then Bob performs some uni- 
tary operation conditioned on the message, applies $ and 
sends some qubits through the classical channel; and so 
on. Conditioning a unitary operation on a measurement 
outcome M can be written as a single unitary matrix 

\0){0\(E)U +\1){1\(E)V (3) 

where U and V are unitary gates operating on the tar- 
get system. Since we never rewrite the qubits storing 
measurement outcomes and only use them for classical 
control of the rest of the system, the only gates we will 
apply to them will be of the form in Eq. (^ . From Fig- 
ure it is easy to see that these commute with $. 

Thus, without loss of generality we can defer every 
application of $ until the end of the protocol. 

This seemingly trivial step turns out to be equivalent 
to requiring Alice and Bob to communicate their com- 
plete measurement outcomes. To see this, note that un- 
til the final discarding of ancillas, $ is only applied to 
the bits that Alice and Bob actually send to each other. 
Equivalently, Alice and Bob only perform projective mea- 
surements on qubits and they always report their mea- 
surement outcomes. This means that both parties always 
know the joint state exactly until the final step when they 
discard ancillas. For this reason, we can deal with pure 
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FIG. 1: Demonstration that $ commutes with the gate in 
Eq. (^). The top Una is an ancilla used to iUustrate how 
$ can be performed, the middle line holds the measurement 
outcome \M) and the bottom line represents the rest of the 
system which we act on. Since the two circuits are equivalent, 
it follows that $ commutes with any gates of the form in 

Eq. a. 



states only, a situation sometimes referred as the Church 



of the larger Hilbert space |27 



Under these conditions, |l7| showed that any measure- 
ment performed by Bob where he communicates the out- 
come to Alice can be simulated by a measurement by 
Alice where she communicates the outcome to Bob (with- 
out changing classical communication cost). This result 
holds because the Schmidt decomposition of a pure state 
is always symmetric under the interchange of Alice and 
Bob. Therefore, without loss of generality, we can con- 
sider a reduced protocol where only Alice performs mea- 
surements and at the end communicates the entire mea- 
surement record to Bob. 

As a final simplification, we note that Alice can com- 
bine a whole sequence of measurements into a single mea- 
surement. 

In summary, given any fixed protocol for entanglement 
manipulation, we can construct an equivalent reduced 
protocol that consists of the following steps: a general- 
ized measurement by Alice, transmission of the complete 
measurement outcome to Bob, a unitary operation by 
Bob conditioned on the measurement and then discard- 
ing of ancillas on both sides. Moreover the modified pro- 
tocol uses the same amount of classical communication 
as the original protocol. ■ 



IV. SIGNIFICANT SUBSPACES OF A DENSITY 
MATRIX 

A typical subspace of a density matrix p is a vector 
space that contains most (i.e. 1 — o(l)) of the weight 
of p but in general has a dimension much smaller than 
the rank of p Q. For our proof we will introduce the 
related concept of a significant subspace, which is a vector 
space that contains a significant portion (meaning some 
0(1) constant) of the weight of p. Neither significant 
subspaces nor typical subspaces have properties that are 
unique to quantum information theory, but we will find 
it more convenient to state our definitions in terms of 
density matrices rather than probability distributions. 

Definition of a significant subspace: Let 11 be the pro- 
jector onto a finite-dimensional vector space V. For any 
density matrix p and < (5 < 1, we say that V is a 
significant subspace of p (or a (5-significant subspace) if 
Tr Hp > 5. 

For any S there can be many different significant sub- 
spaces and the only upper bound we can place on their 
dimension is rank p. However, we will find it useful to ex- 
amine the minimum dimension of any ^-significant sub- 
space for a matrix p. Denote this dimension by S{p, S) 
and define it by 

S{p,6) = {mmTTU\U^ =U,TtUp> 6} (4) 

In discussing bipartite entanglement, the rank of a den- 
sity matrix is often useful to work with because it cor- 
responds to the Schmidt number of an entangled state, 
which can never increase under LOCC. Unfortunately, in 
general a small perturbation can change the rank by an 
arbitrary amount. Significant subspaces are more sta- 
ble under perturbation, and we can use S{p, 6) to derive 
robust bounds on the rank. 

Proposition 3 // < (5 < 1 and p, a are density matri- 
ces with D{p, a) < 2(1 — S) then ranka > S{p, S) 

Proof Let P project onto the support of a. Then 
TrF = ranker and TrPcr = 1. Using Eq. (|), we find 
2(1 - 6) > D{p,a) > 2Trn(CT - p) for any projector U. 
Combining this with Tr Pa = 1, we obtain Tr Pp > 6. 
From the definition of S we have that S{p, S) < Tr P = 
ranker. ■ 

Significant subspaces are also preserved reasonably 
well under tensor product. 

Proposition 4 Let A and B be density matrices with 
Sa,Sb > and Sa + Sb <1- Then S{A(» B,6a + 5b) > 
S{A^B,6a + Sb-SaSb) > {S{A,6a)-1){S{B,Sb)-1). 

Remark: Proposition ^ applies to a density matrix that 
is a product of mixtures. Note that this is a more re- 
stricted condition than the requirement that the state is 
separable, which would have required only that the state 
be a mixture of products. See H] for a discussion. 
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Proof Let a = S{A, Sa) - 1 and 6 = S{B, 6b) - 1. De- 
fine a projector that projects onto the a eigenvectors 
of A with the highest eigenvalues and hkewise define Hb 
to project onto the b eigenvectors of B with the high- 
est eigenvalues. From the definition of S, we know that 
TtUaA < Sa and TyUbB < 5b- 

Now consider the orthogonal complements of T\a and 
II B- Every eigenvalue of (U — Ilyi)^ has at least a eigen- 
values of A greater than or equal to it. Likewise, every 
eigenvalue of(l — nB)i3 has at least b eigenvalues of B 
greater than or equal to it. Therefore, every eigenvalue 
of 

{{t-IlA)®{t-nB)){A®B) (5) 

has at least ab eigenvalues oi A®B greater than or equal 
to it. Equivalently, the ab highest eigenvalues of A® B 
correspond to eigenvectors in the support of 1 — (1 — 

n^)® (a-Hs). 

Furthermore, 

Tr(l-(l-nA)(»(Il-ns))(A(8)B) = (6) 
TtHaA + TyJIbB -TtI{aATtI{bB < Sa + Sb-Sa5b 

In the above, we make use of the fact that Tr HaA < 
Sa, TtUbB < 6b and that TrH^A + TxIVbB - 
Trn^ATrnsB is an increasing fmiction in both Tr 11/174 
and TrHsS. 

Thus, the largest ab eigenvectors of A^B have weight 
less than 6a + 6b~6a6b- This implies 

ab<S{A®B,6A'^6B~6A6B) < S{A(E)B,6a + 6b), (7) 
the desired result. ■ 



Proof A proof can be found in ||^ . ■ 

As a corollary, for any p and 6 there exists uq such 
that for n > no the left side of Eq. ^ is less than 6. 
Another useful consequence is that n copies of a state 
with entropy E have significant subspaces of dimension 

2n-E±0(Vn) 

Proposition 6 Let p, E, a and (3 he as in Lemma |^. 

Then there exist 6 < 1,C, ng such that 5(p",(5) > 
C2'^E+aV^ /or all n>na. 

Here 6 and C are universal constants and uq depends 
only on f3; one valid choice would he 6 = 0.95, C — 0.01 
and no = 10^/?^ 

Proof Let P be the projector onto the eigenvectors 
of p" with eigenvalues above 2~^^~^-^°'^ . Applying 
Lemma|yieldsTrPp" < 7V(-1.1, oo)-|-^ « 0M+^, 
which for some choice of no and 6 can be guaranteed 
to be less than 6. Since P picks out the largest eigen- 
vectors, it minimizes the dimension of a 5-significant 
subspace. Therefore, we have S{p"^,6) > TrP. Now, 
to lower-bound TrP, note that the weight of eigenval- 
ues of p" between 2-"-^-i i"v^ and 2""^-"^^ is at 
least iV(— 1.1, —1) — which we can make greater than 
some constant C > 0. Since the eigenvalues in this 
region are no greater than 2~"'^^"^ it follows that 
^2"B+"v^ < TrP < 5(p",(5). ■ 

Remark: Proposition |^ can be generalized to show that 
for any r > 0, there exist 6r < 1, C and no such that 
S{p'\6r) > C2"-^+™v^ for all n > no. 

We are now ready to prove our main result. 



One application of typical subspaces is to show that 
p" can be compressed to 2"^^^'^^^^^^ dimensions with 
asymptotically small error. This is because for large 
n, the spectrum of p" (for any p) approaches a Gaus- 
sian distribution and almost all eigenvalues are between 
2-nE-0{V^) 2-'^E+o(V^ pj.Q^g ^^lat improv- 

ing upon this result is impossible, we will examine the 
significant subspaces of p". 

First we will need to state central limit theorem in a 
manner that bounds the rate of convergence to Gaussian- 
ity. 



have 



eigenvalues 
2 _ 



Lemma 5 (Berry-Esseen) Let 

Pi,...,Pd and define E ^ -J^iPi^^gpi, a 
J2iPii^ogPi + EY (with a > Q) and (3 = Y.iPi\^°EPt " 
E\^ . Let p,(a,b) denote the sum of all eigenvalues of p 
between 2" and 2^. Then, for all a < b, 



fi{a,b) - N 



nE b + nE 



< 



25/3 



(8) 



where N{xi,X2) is the cumulative normal distribution 



N{xi,X2) 



e 2 dx 



(9) 



V. PROOF OF THE MAIN THEOREM 



A. The inefficiency bound 

Combining Proposition with Proposition ^ wc find 
that any state a with D(a,p^^) < 2(1 — 6) must have 
support on a space of dimension Jl(2"^+"^) |2^. This 
result allows us to prove that the inefficiency (loss of en- 
tanglement) bound applies to any dilution protocol. 
Proof [of part a) of Theorem [l[| Here we will not need 



the protocol reduction of Section HI . Instead, represent a 



general quantum operation as a map from pure states to 
ensembles of pure states. For example, the $ operation of 
Section III can be said to map the pure state a|0) -I- 6|1) 



to the ensemble {pi, |i)}i=o,i with po = |ap and pi — 
Now suppose that Ahce and Bob start with \^d), 
obtain a series of measurement outcomes that indicate 
the protocol has succeeded, and end with an ensemble 
of states {pijlffii}}- By "success" we mean that their 
resulting density matrix is close to the desired state; i.e. 

D{E^P.\^^){^^\,Wn<^■ 

Since the Schmidt number never has any chance of 
increasing, we must have Sch|(y9i) < d for all i. Further- 
more, since the trace distance is convex, there exists an i 
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for which D{\ipi), |V')"1< £■ Let p = TrB|^)(V'|, let E,a 
and (3 be as in Lemma || and choose 5, C and uq according 
to Proposition ^. Then choose eo such that eo < 2(1 — 5). 
Since tracing out Bob's system cannot increase the trace 
distance, we have D{Tv B\'^i){'fi\, p^) < e < 2(1 — 5). 
Again we apply Propositions ^ and || to find that 

C2"^+"^ < rankTrB|(^,)((^,| = Sch\ip,) < d (10) 

proving the desired result (up to an overall constant that 
we can absorb into a and no). B 



B. The communication bound 




leaving Alice and Bob with the pure state \xk)AA'BB'^ 
which can be written as 



(Mfe ® Uk) \<^d) 



Xk) 



(12) 



In section [II, we reduced an arbitrary dilution pro- 



tocol to one consisting of a generalized measurement by 
Alice, a local unitary by Bob conditioned on the result, 
and then the discarding of ancilla systems by both par- 
ties. 

Alice's measurement is the interesting step, since we 
can relate the spectrum of the measurement operator to 
the Schmidt coefficients of the resulting state. If the final 
state roughly resembles \'ip) , then the measurement out- 
come that produced it must also have a typical subspace 
(say of weight 0.99) with dimension 2^^^^ times larger 
than some significant subspace (of weight around 0.25). 
We will show that any such measurement outcome oc- 
curs with probability 2~^(v^) and thus that ^{y/n) bits 
of communication are necessary to tell Bob the result of 
Alice's measurement. 

To deal with the complication of discarding ancillas, 
we state one final lemma. 

Lemma 7 Consider a bipartite Hilhert space 'Hab with 
states 1-0) S Hab, \4') G Ha and 

i?(TrB|^)(V'|,|</))(0|) <e (11) 

Then 3\j) G Hb such that 

i^(IV'),l0)®l7))<2e 

In other words, if tracing out Hb leaves the state al- 
most pure, then the subsystems must have been almost 
separable to begin with. This is proved in Appendix 

Now we can proceed with the proof of the classical 
communication bound. 

Proof [of part b) of Theorem By Claim it suffices 
to consider an entanglement dilution procedure with the 
following form. 

Alice first performs a generalized measurement {M^} 
with 2'^ outcomes on her half of |$d) and transmits her c- 
bit measurement outcome k to Bob, who performs some 
unitary operation Uk conditioned on the result. The re- 
sult of the measurement is k with probability ^Tr MkMl, 



Then Alice and Bob trace out subsystems A' and B' , 
leaving the state Yfc = Ti A'B'\xk){xk\- When the proto- 
col succeeds (which occurs with probability > 2^*), then 
Yk should be close to the desired state |'0)". Specifically, 
we should have 



DiYk,\m< 



(13) 



Since there are at most 2"^ successful values of k (i.e. 
measurement outcomes for which Eq. (|l^) holds), there 
must exist at least one successful value of k that has 
probability > 2-('=+") of occuring. Equivalently, 



TiMkMl > 



2C+S 



(14) 



We will use this value of k implicitly for the rest of the 
proof, and refer to M, U, \x), Y and so forth instead of 
Mfe, Uk, \xk) and Yk. 

Now D(Y,\i;)'') = D{TiA'B'\x){x\,\il))'') < e. Thus 
we can use Lemma ^ to show that 3|7) e Ti-A'B' such 
that 



D{\x)Mr\^))<2e 



(15) 



Define p b\^P){'4^\, X = bb'\x){x\ and P = 

TrB'|7)(7|. Since tracing out a subsystem never in- 
creases distance between two states 



i:i(X,p"(g)r) < 2e 



(16) 



At this point, several different variables have been in- 
troduced to label different subsystems of |a:). To keep 
track of their relations to one another, the following dia- 
gram may be useful. 



\xk)^\i>r®\i) 

jTr^B' 
X « p" P 



Tr Ai ai 



Trv 



Yk 



Tr. 



T^A'X 
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Alice's reduced density matrix, X, turns out to have a 
simple expression in terms of M that will make it quite 
useful to work with. From Eq. (p^, 



TrMM'^ 



x){x\ = (M®C/)|$d)($<i|(Mt0[/t) 



TrMM^ 



^ TrBB'\x)(x\ = Mi^)M^ ^{^)MM^ 



X = Tr 



BB' 



\x){x\ 



TrMM^ ' 



(17) 



Thus \\X\\ = < T^-i— . Plugging in Eq. P) 

" " IrMAft — IrMMt btj to H VLj; 

gives 



-)C+S 



IXII < 



(18) 



Since Schmidt number can never increase by local oper- 
ations, we also have that 



rankX < d 



(19) 



The proof now follows from Eqs. (|l|), (|l|) and (|l|) 
and our results about significant subspaces. 

First define Pi £ B{TiA) to be the projector onto all 
eigenvectors of with eigenvalue of 2~"^ or greater. 
There can be no more than 2"^ such eigenvectors, so 
TrPi < 2"-^. From Lemma |, we have that 



- 2 4 



(20) 



where the last inequality holds as long as ng > 2500/3^. 
In terms of significant subspaces, we can summarize this 
with 



< TrPi < 2"^ 



(21) 



We will now use our bound on the rank of X from 
Eq. ( pj| ) to show that F also has a small significant sub- 
space. To accomplish this we will seek constants 5p and 
5t with the properties: 



• 5n 



5r + €<l 



• 5t > 4eo 

• Vn > 7io,5(p",(5p) > C2"^+"v^ where no depends 
on p and C does not. 

According to Proposition |[ this last condition is met by 
(5p = 0.95. To meet the other two, it will suffice to set 
e < eo = 0.01 and 5r = 0.04. 

Combining Sp + 5t + ^ < 1 with Eq. (|l^) and Proposi- 
tion H yields 

Sip"" ®T,5p + 6t) < 5(p"®F,l-e) 
< rank AT < d 



Applying Proposition ^ now gives 

5(p" «)F,,5p + <5r) 



< 



5(p»,<5p)-l 
d 

d 



(22) 



In the last line, the factors of +1 and —1 are negligible 
compared with the exponentials in the numerator and 
denominator, so we can absorb them into C . 

Define P2 to project onto the highest 5(r,(Sr) eigen- 
values of F. Then TrPaF > 5t and TrPa = 5(F,(5r) < 



Now we combine Eqs. (|l^), ( ^ ) and (22) to obtain 



Tr(Pi®P2)A: < TrPiTrPallATll 
< 2"^ 



C2"n/" 



(23) 



On the other hand. Pi and P2 project onto significant 
subspaces of p" and F respectively, so 



Thus 



Tr(Pi®P2)(p"(»F) > 



2eo > 2e> P>(p"®F,A:) 

> 2Tr(Pi(8)P2)(p"®F-X) 



(24) 



Solving for c -f s yields 



c + s> OL^fn, + log ( — - eo ) - log' C 



Sr 



(25) 



So there exist choices of a, ng, eg that prove the theorem. 



Remark: This bound only assumes 0(1) error; specif- 
ically Eq = 0.01. That is to say, even an entanglement 
dilution protocol with a non-negligible amount of loss of 
fidelity is still covered by Theorem |^, which is, therefore, 
a rather strong result. 

Remark: Suppose we are interested in o(l) error, a 
stricter requirement. It is not difficult to improve our 
classical communication bound to a;(-y/n); by analogy 
with data compression, achieving error e can be shown 
to require inefficiency and classical communication of 
0(a-\/n log(l/e)). See the Remark just after Proposition 
^ for the main modification needed in the proof. 
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VI. CONCLUSIONS AND DISCUSSIONS 



is positive whenever 



We have proven that entanglement dilution from any 
number of singlets to n pairs of bipartite partially entan- 
gled pure states necessarily requires f2(-\/n) bits of clas- 
sical communication, thus showing that the main result 
in is, in fact, optimal. 

A technique of our proof is a generalization of the re- 
duction result of [p^l in entanglement manipulation pro- 
tocols, showed that any exact pure state LOCC en- 
tanglement manipulation protocol could have two-way 
communication reduced to one-way communication. We 
extended their result to protocols ending in mixed states, 
and proved that Alice can be assumed to transmit her en- 
tire measurement outcome with no increase in communi- 
cation. This reduction applies rather generally and may 
be useful in deriving bounds on classical communication 
for other sorts of entanglement manipulations. 

An interesting, but difficult, problem is to try to derive 
similar bounds for transformations from a single copy of 
one state to a single copy of another, \ip2}- Here, 
there are no intermediate results between the constant 
lower bound of [0 and the construction of and 
which uses log rank Tr bits of communication. 

A more specific, and perhaps more tractable, problem 
is that of converting between many copies of different 
partially entangled states. Suppose we constrain these 
sorts of interconversions to waste only o(n) bits of entan- 
glement. Recall from |16 that dilution can be performed 
with 0{^/n) bits of classical communication (and that 
shows how to reduce this cost by a factor of 2). Our 
lower bound in Theorem ^ matches this construction up 
to a constant factor. If we could improve either our lower 
bound or the protocol of |l^ to eliminate this con- 
stant factor, then we could prove that, at least for some 
partially entangled states \tpi) and \ip2), preparing IV'2)" 

from requires ^l{^/n) bits of communica- 

tion. 

To see this, consider the following two processes. Pro- 
cess A: Start from singlets, dilute to n copies of |'i/;2) 
directly. Process B: Start from singlets, dilute first to 
n ^l^^j + o{n) copies of 1-01) and then apply a conver- 
sion procedure from the n ^|^^| -|- o(n) copies of to 
n copies of \ip2)- From Theorem 0, Process A takes 
ri(Q!^2V^) bits of classical communication. Process 
B is a way of realizing Process A and is, thus, con- 
strained by our lower bound. Therefore, it must also take 
ri(a^2 V"-) bits of classical communication. Now, 
showed that Part 1 of Process B can be done with only 

O ^a^i bits of classical communication. If the 

constant factors of Theorem |l| and |l6| were the same 
then the classical communication cost of Part 2 of Pro- 
cess B would be lower-bounded by an amount of that is 
at least the difference between these two numbers, which 



> 



(26) 



This would establish a total ordering on entangled states. 
If, instead, we were unable to close the gap between the 
constants of the upper and lower bounds, then we would 
have a partial ordering; for some constant C > 1, convert- 
ing to \'ip2) would require communication whenever 

It is an open question whether Eq. ( p6| ) is a neces- 
sary condition for a classical communication bound or 
whether such a result holds for more general pairs of 
partially entangled states. Also, unlike the case of di- 
luting from maximally entangled states, there may be a 
tradeoff between inefficiency and communication when 
starting with partially entangled states. 

We can also apply our main theorem to the resource 
model of quantum information processing, where it im- 
plies that there is a limited extent to which the resource of 
entanglement can be thought of as independant of form. 
Thus, when one considers scenarios with prior shared en- 
tanglement, one should either a) restrict the scenario to 
o(ri^) copies of any partially entangled state (if the proto- 
col calls for 0{n) bits of classical communication) or b) 
specify explicitly what forms of bipartite entanglement 
are allowed. 

More generally, the classical communication cost in 
quantum information processing is an important subject 
[Q. One rather curious fact about our result is that 
dilution requires a large amount of classical communica- 
tion, but there does not appear to be any simple way to 
use it for signaling. It would be interesting to determine 
whether any black box capable of performing entangle- 
ment dilution could also be used to transmit information, 
as this would provide an intuitive alternate proof of our 
main result. 

Two final remarks are in order. First, the classical bits 
transmitted in entanglement dilution constitute classical 
shared randomness between Alice and Bob. Such clas- 
sical shared randomness can be an important resource 
in information processing. However since those bits are 
sent through a classical channel, their value is potentially 
public knowledge. Thus, we can call them shared public 
randomness. Second, the loss of entanglement in, for 
example, entanglement concentration will generally give 
rise to shared randomness that is private to Alice and 
Bob. Similarly, the loss of entanglement in entanglement 
dilution will give rise to residual correlations that are al- 
most uncorrelated to the desired final state (according 
to Eq. (p^)). Such randomness is private to Alice and 
Bob, and could therefore be used for applications such 
as a cryptographic one-time-pad . We believe that a 
complete theory of the resource model of quantum infor- 
mation processing should take full account of these two 
resources — shared public randomness and shared private 
randomness. 
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After the completion and circulation of a draft version 
of the current paper, we became aware of the indepen- 
dent proof of almost the same result but with a different 
approach by Patrick Hayden and Andreas Winter S. 



where F{po,pi) = Ti: y/ y/poPiy/po is the fidelity. By 
Uhlmann's theorem [E3| 



F{Po,Pi) = max |((/?o|</'i>| 



(A2) 
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APPENDIX A: PROOF OF LEMMA @ 

Proof For any density matrices po ^^nd pi , Eq. (46) of 
IpI states that 



where (po and ipi are purifications of pg and pi, respec- 
tively. Equivalently we can fix an arbitrary purification 
(po and maximize only over ipi. 

Applying these two results we find that there exists 
1 7) E Ti-B such that 



|(^|(|0)®l7»l>l-e/2 



(A3) 



since we can consider \(f)) (8) I7) to be a purification of 

To obtain the trace distance between these states, 
write 10) ® I7) as a\ip) + blip-^}, where |a| > 1 — e/2 and 
\b\ — — |ap < i/e — 6^/4. The trace distance is then 
given by 



Tr 



l-|a|2 




2|6|2 < 2e- — < 2e (A4) 
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yield dilution protocols since the proof is highly insensi- 
tive to the protocol's success probability. 
This well-known result is a consequence of our Proposi- 
tion ^ 

The Church of the Larger Hilbert space is a fruitful idea 
in quantum information theory. It has been applied in, 
for example, the proof of the impossibility of quantum 
bit commitment [ p^ , ^ and quantum oblivious transfer 
p3| . For a review, see, for example, [^ 
This means that to guarantee 0(1) error, Schumacher 
compression (and hence the dilution protocol given by 
BBPS) must have S7(-yn) inefficiency. We can use a simi- 
lar argument to prove that this inefficiency bound applies 
to any dilution protocol. 

However, Alice and Bob also have the option of keep- 
ing the ancillary state quantum, rather than measur- 
ing it and generating classical randomness. According to 
Lemma 1^, this ancillary state is nearly separable from the 
output of the dilution procedure. Thus, Alice and Bob 
could consider keeping their ancillas and later recycling 
some of its entanglement. 



